Practical Implications of Using Different Tests of Measurement Invariance for Polytomous Measures
نویسندگان
چکیده
Using male/female and Caucasian/African American comparison groups, this study examined the practical ramifications of using two IRT-based analytic methods, DFIT and the Likelihood Ratio Test (LRT), in assessing the measurement invariance of a 21-item leadership development scale under ten sample size conditions (e.g., 200, 500, & 1000). In nine of ten conditions, the LRT identified multiple items that exhibited DIF whereas DFIT only detected a single item with DIF in one set of analyses. Conclusions based on the LRT indicated a lack of measurement invariance for the scale, while DFIT implied near perfect measurement invariance. Thus, these findings highlight the implications of choice of analytic method on the determination of measurement invariance in applied samples.
منابع مشابه
The effects of the violation of local independence assumption on the person measures under the Rasch model
Local independence of test items is an assumption in all Item Response Theory (IRT) models. That is, the items in a test should not be related to each other. Sharing a common passage, which is prevalent in reading comprehension tests, cloze tests and C-Tests, can be a potential source of local item dependence (LID). It is argued in the literature that LID results in biased parameter estimation ...
متن کاملSame Question, Different Answers: CFA and Two IRT Approaches to Measurement Invariance
The effectiveness of confirmatory factor analytic (CFA) and item response theory (IRT) methods of assessing measurement invariance were investigated using simulated data with a known lack of invariance. Across all study conditions, IRT likelihood ratio (LR) tests consistently outperformed both CFA and IRT differential functioning of items and tests (DFIT) analyses in terms of detecting a lack o...
متن کاملSensitivity of DFIT Tests of Measurement Invariance for Likert Data
Likert scales are routinely used in educational and psychological research as measures of constructs of interest. If sound scale development procedures are followed, the resulting scale can reliably and validly measure a construct. However, if a given scale is used to make comparisons among different populations of respondents (e.g., cultures; Riordan & Vandenberg, 1994), over time in longitudi...
متن کاملFactorial invariance of the Brief Symptom Inventory-18 (BSI-18) for adults of Mexican descent across nativity status, language format, and gender.
The cultural equivalence of psychological outcome measures remains a major area of investigation. The current study sought to test the factor structure and factorial invariance of the Brief Symptom Inventory-18 (BSI-18) with a sample of adult individuals of Mexican descent (N=923) across nativity status (U.S.- vs. foreign-born), language format (English vs. Spanish), and gender. The results sho...
متن کاملSample Size and Tests of Measurement Invariance
Measurement equivalence/invariance (ME/I) can be thought of as operations yielding measures of the same attribute under different conditions (Horn & McArdle, 1992). These different conditions include stability of measurement over time (Golembiewski, Billingsley, & Yeager, 1975), across different populations (e.g., cultures; Riordan & Vandenberg, 1994), different mediums of measurement administr...
متن کامل